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Abstract 

In this paper we have proposed a new test for pixel randomness using non-parametric method 
in statistics. In order to validate this new non-parametric test we have designed an encryption 
scheme based on 2D cellular automata. The strength of the designed encryption scheme is first 
assessed by standard methods for security analysis and the pixel randomness is then determined 
by the newly proposed non-parametric method. 
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1 Introduction 

Multimedia images are widely used in internet communications, so the need for securely transmit¬ 
ting these images over networks has become an essential part of the field of data security. Image 
cryptography plays a vital role in securing confidential images. The purpose of image cryptography 
is to hide the content of the images by encrypting them so as to make the images unrecognizable 
to the intruders. One part of this paper deals with a new image cryptosystem using 2D Cellular 
Automata. 


In general, there are two different methods to protect an image; they are (i) image shuffling and (ii) 
image encryption. Pixels positions are rearranged in image shuffling whereas in image encryption, 
pixel values and positions are changed. In both the cases it is very essential to check the security of 
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Figure 1: von-Neumann and Moore Neighborhoods 


the method. That means the method should be invulnerable to all attacks. Poorly protected images 
will always provide information about the original image in statistical analysis. If the encrypted 
image is indistinguishable from the random image, statistical analyses will not have advantage to 
break. So testing the randomness in the pixels of encrypted image is the state of the art. In the 
literature already there are different tests for checking randomness for ID datafU]. A number of 
parametric tests are designed for pixel randomness in shuffled and encrypted images [71 dSl dSl- A 
non-parametric test is developed in this paper for checking randomness in the image pixels, which is 
the first of its kind. 

John von Neumann proposed a new emerging concept called Cellular Automata (CA)[T2]. CA is a 
discrete model consisting of regular grid of cells, each in one of the finite number of states. According 
to some fixed rule, the state of each cell will be changed in terms of the state of the current cell and 
the states of the cells in its neighborhood. Like ID, higher dimension CA also can be defined [H]. It 
is already proved that some of the rules of CA will be able to generate complex random patterns [Hj. 
In the last two decades ID and 2D CAs are used in Cryptography. Lot of research is going on for 
CA based image cryptography mmm- The paper is organized as follows: Section 2 describes the 
basics of 2D cellular automata and the 2D CA concepts used for our encryption scheme. Section 3 
presents our new non-parametric test for pixel randomness. Section 4 describes our 2D CA based 
encryption scheme. Section 5 presents the simulation results and performance evaluations. Section 
6 gives the concluding remarks. 


2 2D Cellular Automata 

The extensional behavior of ID Cellular Automata is also able to produce complicated patterns in 
two-dimension. This extension is significant since this is compared with pattern formation in physical 
systems. 2D CA is a regular 2D lattice of cells. Each cell has n possible values and is updated in 
each discrete time steps according to a rule f that depends on the value of sites in some neighbor 
around it. There are different types of lattices and neighborhood structures in a 2D CA. Figure 1 
shows the two familiar neighborhood structures named as von-Neumann neighborhood and Moore 
neighborhood. The value of Ojj of a cell at (z,j)th position in {t + l)th time in a 2D CA with 
a rule f, that depends only on the cells according to von-Neumann neighborhood is evolved from 
^ f((1^ (1^ (1^ 1 
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2.1 Mathematical Model 


ID and 2D CA are studied differently with the help of polynomial algebra and matrix algebra by re¬ 
searchers. P.P.Choudary et.ahalready have studied and designed a new characteristic of 2D CAjHin]. 
These concepts are used for our proposed encryption method. 2D nine neighborhood Moores neigh¬ 
borhood is used here for evolutions. Different rules are defined in terms of dependencies from the 
following rule convention. 
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Here the central box is the cell being considered for evolution and the other boxes are its neighbors. 
The numbers in the boxes are used to specify the boxes for a particular rule. For example, rule 
2 characterizes the dependency on its right neighbor whereas such dependency only on its bottom 
neighbor is characterized by rule 8 and so on. Rules, which are having dependency only on one cell 
are called fundamental rules. Suppose a cell has dependency on more than 2 neighbors. Then the 
rule number will be the sum of the numbers of the corresponding cells. For example 2D CA rule 15 
(l-|-2-|-4-|-8) refers to the bottom-right 2x2 cells. There are 512 such rules including zero dependency. 


These 512 rules are classified in to 9 different groups in terms of the number of Is in the binary 
representation of the rule number. For example rule 8 belongs to Group 1, rule 320 belongs to Group 
2 etc.. 


2.2 Uniform CA Rules 


Evolution of such 2D CA rules can be considered as matrix operations with Os and Is. If same rule 
is applied to each entry of the given problem matrix, then the CA is called uniform CA. Example 2.1 
shows the application of the rule 15 in the problem matrix of size (2x3) with null boundary condition. 


Example 2.1: 


/ 1 0 0 
y 0 1 0 


Rulel5 
-^ 


0 1 0 
1 1 0 


Let St denote the binary matrix which represents the current state of a 2D CA. The next state of 
a cell is obtained by calculating XOR operation of the states in the corresponding boxes of a rule. 
In this example the states in the dependent neighbors of the 1st cell are 1,0,0,1 according to rule 15 
and its XOR value 0 is its next state value. 


To apply the rules to the problem matrix, one can use the rule matrices which are used in . Eor a 
problem matrix of size (m x n), the size of the rule matrix Rn is (mn x mn) since the problem matrix 
is applied as a vector of order (mn x 1) got from the row major order. 

Structure of rule matrices: 


3 








Let Rn denote the rule matrix for rule n, for n=0,1, 2,511. It is found that these rule matri¬ 
ces are related to each other in the following way. 

Ri —>■ Identity matrix. 

(i?2)^ = i?32, {R4)^ = R&A, (Rs)^ = -Ri28) = -^256 ) 

where the power T stands for the transpose of the matrix. By having these 5 fundamental matrices, 
one can easily get the rule matrices of all other rules using transpose and the XOR operations as 
given in example 2.2. 

Example 2.2: 


Ri 5 — Rg Ra R2 + Ri- 

Rs 2 = RgA + Ri 6 + R2 = (-^4)^ + Ri 6 + R 2 - 

R327 — R256 "I" Rga + Ra + R2 "I" Ri — {Rie)^ + Rga "I" Ra "I" R2 "I" Ri■ 


In [Hj P.P.Choudary et. ah have found new way of constructing rule matrices using the following 
two binary sequences Si and S 2 . 

51 = m^om^o]d^o..o]d^ 

That is Si starts with (n-1) I's followed by one 0 then (n-1) I's followed by one 0 and finally ends 
with (n-1) 1 s. 

5 2 = 0]d^0]d^0]d^0..0]d^0 

That is S 2 starts with one 0 followed by (n-1) I's and one 0 then (n-1) I's and ends with (n-1) I's 
followed by one 0, where n is the number of columns in the problem matrix. 


The five basic matrices are defined as follows with the structure of above sequences. 

Ri = Identity matrix. 

R 2 = (n-l)th diagonal elements are arranged like sequence Si and all other elements are 0. 

Ra = (n-|-l)th diagonal elements are arranged like sequence 5'i and all other elements are 0. 

Rg = nth diagonal elements are 1 and all other elements are 0. 

Riq = (n-l)th diagonal elements are arranged like sequence S 2 and all other elements are 0. 

Other rule matrices are defined as in example 2. Among all these 512 rules we are interested only in 
rules with non-singular matrices as they are reversible which are useful in Cryptography. There are 
only 31 rules that are reversible and those rules are expressed in terms of Ri since they have 1 s in 
their diagonal. 


2.3 Hybrid CA Rules 

Unlike uniform CA, in hybrid CA each cell has its own local rule, that is, different rule is used for 
different cell. The total number of matrices of order (mn x mn) with entries 0 and 1 is , whereas 

we dealt with only 512 rules which are corresponding to linear Boolean rules. The rule matrices of 
hybrid rules we are discussing here are corresponding to remaining -512 matrices. 


4 


Example 2.3: 


Let us consider the hybrid rule which defines as follows for a 2x2 problem matrix. 

1. 1st cell changes its state by looking at the valnes in 1st, 3rd and 4th positions of the problem 
matrix. 

2. 2nd cell changes its state by looking at the values in 2nd and 3rd positions of the problem matrix. 

3. 3rd cell changes its state by looking at the values in the 4th position of the problem matrix. 

4. 4th cell changes its state by looking at the valnes in the 1st, 2nd and 3rd position of the problem 
matrix. 



All such non-singular matrices of order (mn x mn) 
raphy purposes. 


0 11 ^ 

1 1 0 
0 0 1 
1 1 0 y 

are reversible and they can be used for cryptog- 


3 A new Pixel Randomness Algorithm 

Strength of an image encryption method measured by showing the randomness in the pixels of the 
shuffled and encrypted images. In literature we already have some parametric tests, which are used 
for checking randomness in the pixels of the images. But these tests are based on the assnmption 
that the observations are treated as samples drawn from normal popnlations. If this is the case, 
these methods extract all the information that is available in a sample, and they usually attain the 
best possible precision. However, since there are many situations where it is doubtful whether the 
assumption of normality can be met, so the non-parametric tests came in to picture. It is useful 
to have a technique for testing whether a sample may be looked upon as random after it has been 
obtained. One such technique is based on the order in which the sample values were obtained, more 
specifically, it is based on the nnmber of runs exhibited in the sample results [6]. 


A process which produces independent and identically distributed (i.i.d) samples is called a ’’truly 
random process” in Statistics. A process is not considered truly random if the value in the sequence 
can be determined by its position. The property of iid is eqnivalent to the randomness, and it 
forms the basis for many statistical tests. Many practical applications will not have true random 
observations, but these simple tests are still useful to check how far the observations being close to 
randomness. It has very good application in cryptographic security, where it is used to check the 
randomness in various random number generators. 


Investigations of randomness of a given sequence often require statistical tools for distribution 
comparison. Among them, goodness-of-fit tests and entropy estimates are two well-understood 
concepts[IH]. However, when the distribution of the observed data is unknown, , we have to go 
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for non parametric tests, using some distribution invariant properties of random process. For ex¬ 
ample, the observations are transformed to a sequence of symbols that can specify their relative 
positions or magnitudes. The pattern of this sequence will serve as a measure of the randomness of 
the original process. 


A run is defined as a succession of one or more identical symbols in the ordered symbol sequence. 
For example, the Os and Is in a bit string such as 0000011111 and 0011001101 have 2 and 6 runs 
respectively. The number of runs and their sizes are used as a measure of the randomness of the 
observations. In a truly random sequence, we very rarely encounter too few, too many runs or runs 
with long length, etc., So the presence of any of these in an ordered symbol sequence is used as a 
statistical criteria for rejecting null hypothesis. Also these criteria are related to each other as too 
few runs results in some long length runs and too many runs results in some short length runs. So 
the number of runs is considered as a useful parameter in the randomness test. 


Dichotomizing criteria is used for symbolizing the sequence for quantitative observations. By com¬ 
paring each number in the sequence to a focal point (mean or median), it can be symbolized as -|- 
or depending on whether the number is greater or lesser than the focal point. Also, the trend or 
autocorrelation of the sequence can be determined from the relative magnitudes or ranks of adjacent 
numbers. 


If a sequence contains rii symbols of one kind and n 2 of another kind, the sampling distribution of 
the total number of runs, u, can be approximated closely by a normal distribution with 




2nin2 
ni -\- n2 


+ 1 


and 


I2nin2{2nin2 — ni — 7 ^ 2 ) 
(ui -F n2)^(ni -I- 7^2 - 1) 


( 1 ) 


Thus the test of the null hypothesis i^olhat the arrangement of the symbols is random, can be based 
on the statistic 


Z = 


U- jXu 


( 2 ) 


which has approximately the standard normal distribution. 

In order to carry out the test for randomness, we set Hq as arrangement is random against the 
alternating hypothesis Hi as arrangement is not random. By setting the level of significance as a, 
we make use of the following criterion for the critical region. U z < —Zai 2 or z > Zai 2 then reject 
Hq and then we conclude the result as arrangement is random or not with (1 — a) x 100%level of 
confidence. We take a — 0.01 and make the conclusion about randomness with 99% of confidence. 
One can see from the standard normal distribution table that Zai 2 = 2.575. 


In our method the runs are formed in the following manner: First, t distinct pixels are selected 
randomly from the image for which the pixel randomness is to checked. Now for each of these t 
pixels consider its Moore’s neighborhood pixels including the center pixel. Find the mean value of 
these 9 pixels for all the t pixels. Now we have t number of mean values. Now move horizontally (or 
vertically or diagonally) two steps from each of these t pixels, again we have now t points and find 
the t number of Moore’s neighborhood mean values for these t pixels. Now we have totally 2t mean 
values. Do the same process h number of times, and we will have ’ht’ mean values. Find the median 


6 



for all these ht mean values. Compare this median value with each of these ht mean values, if the 
mean value is less than the median then make the run entry as a otherwise b. Now we will have a 
sequence of a s and b s. This sequence is now exhibiting the runs. With these runs, the test has to 
be done using the equations 1 and 2. Moving along horizontal(or vertical or diagonal) direction is 
required to consider the distribution of pixels focally. The following algorithm explains the test. 


Algorithm: 

Notations: 

• P is an image to be tested. 

• N is the number of tests. 

• T is the number of evaluations. 

• n is the number of random pixels from P. 

• h is the number of horizontal or vertical movements. 


1 . 

2 . 

3. 

4. 

5. 

6. 

7. 

8. 

9. 

10 . 
11 . 
12 . 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

20 . 
21 . 
22 . 

23. 

24. 

25. 

26. 

27. 

28. 
29. 


I = 1 . 

for i=l to T 

Cl = 0 

for j=l to N 

Choose n random pixels (pi,P2)P3) ..pn)without repetition, 
for t=l to h 

Compute n number of means (ruti, mt 2 , rnts, ..mt„)from each pixel’s Moore’s neighborhood. 
Move 2 pixels horizontally (or vertically or diagonally), 
end for 

Find the median d for the hn means, 
for s=l to h 
for t=l to n 

> d) 

else 

end if 
end for 
end for 

Count the number of I’s and O’s in r and denote as ni and n2 respectively. 

Count the number of runs u in r. 

Compute fj^u and au using 1. 

Perform the test using the statistic 2. 
if {-Za/2 <Z< Z^I2) 

Cl = Cl + 1. 
end if 
end for 
C2[l] = Cl. 


30. end for 

31. C=((ELiC2[A:])/T)*100. 

C measures the image pixel randomness in the range 1-100. 
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4 Encryption Scheme 


The new 2D CA based encryption scheme in this paper is considered for gray scale and color images. 
Basic idea of the proposed image encryption, decryption method is based on the idea in [3j. Pixel 
values are changed by considering its binary representation using 2D CA. Encryption process has 
been done in two parts. In the first part, image scrambling is performed on the binary string form of 
the image and in the second part this scrambled image is been encrypted using 2D CA by considering 
the problem matrix of size (5x5). 

4.1 Encryption Algorithm 

This scheme can be applied on any square size problem matrix. We have implemented the scheme 
for the problem matrix of size (5 x 5).If the plain image size is mxn then the size of the plain image 
in binary form is mx8n. Pad rows and columns with Os in the binary form such that the number of 
rows and columns are multiple of 5. Let the size of this matrix be {pi x ^ 2 )- Perform the permutation 
on the columns and rows of this matrix. This scrambled image is now considered as the input image 
for the 2D CA encryption scheme. Extract the (5 x 5) sub matrices in row major order from the 
scrambled binary image of size {pi x P 2 ). Now we have (y x y) array P of (5 x 5) sub matrices. 
Consider each (5 x 5) sub matrix as a problem matrix and apply the proposed 2D CA scheme. Rules 
with invertible rule matrices are used for encryption purpose. 

Encryption can be done in the following different ways: 

Wl: All (5 X 5) problem matrices in P use the same invertible uniform rule. 

W2: Each row (column) of P uses different invertible uniform rules. 

W3: Each problem matrix in P uses a different invertible uniform rule. 

W4: All (5x5) problem matrices in P use the same invertible hybrid rule. 

W5: Each row (column) of P uses different invertible hybrid rules. 

W6: Each problem matrix in P uses a different invertible hybrid rule. 

As we move from Wl to W6, the size of the key space is increased. After following any one from 
the above 6 ways, we will get the encrypted image. For the RGB color image, the encryption has 
to be done on each component separately. For the sake of increasing the difficulty of cryptanalysis, 
one can always follow W3 or W6 for encryption. The following algorithm explains the encryption 
process in Wl. 

Algorithm: 

Get the matrix P from the binary scrambled plain image by clubbing (5x5) submatrices in row major order. 
Input: Matrix P of size (mxn) of (5x5) submatrices. 

Output:Matrix C of size (mxn) of size(5x5) submatrices. 

1. Choose an invertible uniform rule r. 

2. Construct the rule matrix R of size (25x25). 

3. for i=l to m 

4. for j=l to n 

5. Get the vector P\i^j) of size (25x1) from P{i^j) by ordering the elements in row major order. 

6. C'{i^j) = i? * P'{i^j). (where * is matrix multiplication in Z 2 ). 

7. Get the matrix C{i^j) of size (5x5) from by arranging the elements in row major order. 

8. end for 

9. end for 

We will get the encrypted image from the gray scale representation of the re-ordered binary matrix 

C. 



4.2 Decryption 


As we have used invertible uniform and hybrid rules for encryption, inverse exists for all the encryp¬ 
tion rnles. So the decryption can be done using the inverse rule matrices of the encryption rnles 
matrices. For the color images, decryption can be done on the three components separately using 
the inverse rule matrix (matrices) of the corresponding encryption rule matrix (matrices). 


5 Simulation Results and Performance evaluation 

Several simulations are conducted to test various properties of the proposed 2D CA image encryp¬ 
tion/decryption scheme including information concealing, confusion and diffnsion properties. For 
the gray scale pepper image and the RGB color Lena images, we have performed all the ways of 
encryption which are mentioned above. Figure 2 shows the experimental results of the encryption 
scheme, (a) shows the gray scale Pepper image, (b),(c) and (d) show the encrypted images of (a) 
by Wl, W2 and W3 respectively, (e) shows the RGB color Lena image, (f), (g) and (h) show the 
encrypted images of (e) by W4, W5 and W6 respectively. 



Figure 2: (a) Original Pepper Image; (b) Encrypted (a) by Wl; (c) Encrypted (a) by W2; (d) 
Encrypted (a) by W3; (e) Original Lena Image; (f) Encrypted (e) by W4; (g) Encrypted (e) by W5; (h) 

Encrypted (e) by W6. 


5.1 Information concealing 

If the key in the receiver side is identical with the secret key, the image can be revealed withont loss. 
Figure 3 shows the results of our experiments with gray scale House image and the Lena color image, 
in both the experiments we could get back the original images using the correct secret key. In the 
figure 3, (a) and (d) are the original images, (b) and (e) are the encrypted images and (c) and (f) 
are the decrypted images. 
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Figure 3: (a) Original House Image; (b) Encrypted House Image by W3; (c) Decrypted House Image; 
(d) Original Lena Image; (e) Encrypted Lena Image by Wl; (f) Decrypted Lena Image. 


5.2 Confusion Property 

It is always essential to make the relationship between the key and cipher text as complex as possible 
so that the attackers can not get the key from the cipher text. Histograms of the original and 
encrypted images of the above experiments are given in Figure 4. The histograms of the encrypted 
images are almost uniform regardless of the original images and are significantly different from those 
of the original images. The results exhibit the confusion property of the proposed scheme. 


5.3 Diffusion Property 

In any strong cryptosystem, key plays the main role to spread out the redundant bits in the plain 
image over the cipher image along with the encryption method. So small changes in the key, produces 
a significantly different cipher image. This property is termed as diffusion property. We have 
performed encryption and decryption key sensitivity tests to show the diffusion property in our 
method. Figures 5 & 6 show the test results of encryption and decryption key sensitivity analysis 
respectively. These results clearly exhibit the diffusion property. 


5.4 Statistical Analysis 

5.4.1 Correlation Analysis 

In a perfectly encrypted images and in a perfectly shuffled images, any pair of adjacent pixels along 
horizontal, verical or diagonal directions should not be correlated. 1000 random pairs of adjacent 
pixels have been chosen horizontally, vertically and diagonally from the plain and encrypted images 
for the correlation analysis. The correlation coefficient has been computed between two adjacent 
pixels using the equation 6. 
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( 4 ) 


i=l 

1 ^ 

cov(x, y) = ^ ~ ^(^))(^* “ ^(^)) (^) 

i=l 

^ cov(x, y) 

\/D(x)^D(y) 

Table [1] gives the correlation coelRcient values of adjacent pixels in the Lena and Pepper images 
along horizontal, vertical and diagonal directions of the plain and encrypted images. This confirms 
that the adjacent pixels in the plain images are strongly correlated where as the adjacent pixels in 
the encrypted images are weakly correlated. Figure 7 illustrates the correlation distribution of the 
plain and encrypted Lena, Pepper as well as House images. Correlations along horizontal, vertical 
and diagonal directions are tested for Lena, Pepper and House images respectively. We can clearly 
see that the encrypted images are very weakly correlated. 


Images 

Horizontal 

Vertical 

Diagonal 

Original Lena Image 

0.9746 

0.9381 

0.9438 

Encrypted Lena Image 

-0.0184 

0.0352 

0.0228 

Original Pepper Image 

0.9143 

0.9403 

0.9552 

Encrypted Pepper Image 

0.0110 

0.0065 

-0.0104 


Table 1: Correlation coefiicients between adjacent pixels for original and encrypted images. 


5.4.2 Non-Parametric Test for Pixel Randomness 

Evaluation results of the proposed non-parametric test for the house images are given in figure 8. 
Scores of the other tests images are given in the table 2. As one can see that the perfectly encrypted 
and perfectly shuffled images have high scores. In contrast the original image has very low score 
and the partially shuffled and encrypted images have average scores. Scores of the different images 
are varied according to their pixel randomness. Images with high scores are indistinguishable from 
random patterened images. 


Images 

Original 

Partially Shuffled 

Partially Encrypted 

Perfectly Shuffled 

Perfectly Encrypted 

Lena Image 

1.3 

65.4 

65.8 

97.6 

98.8 

Boat Image 

2.1 

61.2 

65.7 

98.6 

97.5 

Pepper Image 

2.6 

66.3 

67.4 

97.4 

98.5 


Table 2: Scores of Test Images. 
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Original Encrypted 



(b) 





score:2.3 score:67.3 score:64.4 score:95.5 score:96.2 

Figure 8: (a) Original (b) Partially Shuffled (c) Partially Encrypted (d) Perfectly Shuffled (e) Perfectly 

Encrypted 


6 Conclusion 


We have introduced a new test using non-parametric method in statistics for measuring the random¬ 
ness in the image pixels. It scores the images, depending upon how far their pixels are i.i.d. The 
newly designed 2D CA based encryption scheme first analysed by standard evaluation methods and 
then the encrypted images are used to validate our newly proposed non-parametric test. Simulation 
results clearly show that this test can be used for evaluating the quality of any image shuffling and 
image encryption method.The complexity of the method is 0{NThn), so it can be efficiently imple¬ 
mented. In the proposed non-parametric test we have only used the basic run test to measure the 
randomness. 
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